NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SCAR: Scheduling Multi-Model AI Workloads on Heterogeneous Multi-Chiplet Module Accelerators

https://doi.org/10.1109/MICRO61859.2024.00049

Odema, Mohanad; Chen, Luke; Kwon, Hyoukjun; Al_Faruque, Mohammad Abdullah (November 2024, IEEE)

Full Text Available
MaGNAS: A Mapping-Aware Graph Neural Architecture Search Framework for Heterogeneous MPSoC Deployment

https://doi.org/10.1145/3609386

Odema, Mohanad; Bouzidi, Halima; Ouarnoughi, Hamza; Niar, Smail; Al Faruque, Mohammad Abdullah (October 2023, ACM Transactions on Embedded Computing Systems)

Graph Neural Networks (GNNs) are becoming increasingly popular for vision-based applications due to their intrinsic capacity in modeling structural and contextual relations between various parts of an image frame. On another front, the rising popularity of deep vision-based applications at the edge has been facilitated by the recent advancements in heterogeneous multi-processor Systems on Chips (MPSoCs) that enable inference under real-time, stringent execution requirements. By extension, GNNs employed for vision-based applications must adhere to the same execution requirements. Yet contrary to typical deep neural networks, the irregular flow of graph learning operations poses a challenge to running GNNs on such heterogeneous MPSoC platforms. In this paper, we propose a novel unifieddesign-mappingapproach for efficient processing of vision GNN workloads on heterogeneous MPSoC platforms. Particularly, we develop MaGNAS, a mapping-aware Graph Neural Architecture Search framework. MaGNAS proposes a GNN architectural design space coupled with prospective mapping options on a heterogeneous SoC to identify model architectures that maximize on-device resource efficiency. To achieve this, MaGNAS employs a two-tier evolutionary search to identify optimalGNNsandmappingpairings that yield the best performance trade-offs. Through designing a supernet derived from the recent Vision GNN (ViG) architecture, we conducted experiments on four (04) state-of-the-art vision datasets using both (i) a real hardware SoC platform (NVIDIA Xavier AGX) and (ii) a performance/cost model simulator for DNN accelerators. Our experimental results demonstrate that MaGNAS is able to provide1.57× latency speedup and is3.38× more energy-efficient for several vision datasets executed on the Xavier MPSoC vs. the GPU-only deployment while sustaining an average0.11%accuracy reduction from the baseline.
more » « less
Full Text Available
SEO: Safety-Aware Energy Optimization Framework for Multi-Sensor Neural Controllers at the Edge

https://doi.org/10.1109/DAC56929.2023.10247751

Odema, Mohanad; Ferlez, James; Shoukry, Yasser; Al Faruque, Mohammad Abdullah (July 2023, IEEE)

Full Text Available
Map-and-Conquer: Energy-Efficient Mapping of Dynamic Neural Nets onto Heterogeneous MPSoCs

https://doi.org/10.1109/DAC56929.2023.10247722

Bouzidi, Halima; Odema, Mohanad; Ouarnoughi, Hamza; Niar, Smail; Al Faruque, Mohammad Abdullah (July 2023, IEEE)

Full Text Available
PrivyNAS: Privacy-Aware Neural Architecture Search for Split Computing in Edge-Cloud Systems

https://doi.org/10.1109/JIOT.2023.3311761

Odema, Mohanad; Faruque, Mohammad Abdullah (January 2023, IEEE Internet of Things Journal)

Full Text Available
EnergyShield: Provably-Safe Offloading of Neural Network Controllers for Energy Efficiency

https://doi.org/10.1145/3576841.3585935

Odema, Mohanad; Ferlez, James; Vaisi, Goli; Shoukry, Yasser; Al Faruque, Mohammad Abdullah (May 2023, ACM)

Full Text Available
HADAS: Hardware-Aware Dynamic Neural Architecture Search for Edge Performance Scaling

https://doi.org/10.23919/DATE56975.2023.10137095

Bouzidi, Halima; Odema, Mohanad; Ouarnoughi, Hamza; Al Faruque, Mohammad Abdullah; Niar, Smail (April 2023, IEEE)

Full Text Available
Romanus: Robust Task Offloading in Modular Multi-Sensor Autonomous Driving Systems

https://doi.org/10.1145/3508352.3549356

Chen, Luke; Odema, Mohanad; Faruque, Mohammad Abdullah (October 2022, ICCAD '22: Proceedings of the 41st IEEE/ACM International Conference on Computer-Aided Design)

Full Text Available
Testudo: Collaborative Intelligence for Latency-Critical Autonomous Systems

https://doi.org/10.1109/TCAD.2022.3211480

Odema, Mohanad; Chen, Luke; Levorato, Marco; Faruque, Mohammad Abdullah (October 2022, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems)

Full Text Available
SAGE: A Split-Architecture Methodology for Efficient End-to-End Autonomous Vehicle Control

https://doi.org/10.1145/3477006

Malawade, Arnav; Odema, Mohanad; Lajeunesse-degroot, Sebastien; Al Faruque, Mohammad Abdullah (October 2021, ACM Transactions on Embedded Computing Systems)

Autonomous vehicles (AV) are expected to revolutionize transportation and improve road safety significantly. However, these benefits do not come without cost; AVs require large Deep-Learning (DL) models and powerful hardware platforms to operate reliably in real-time, requiring between several hundred watts to one kilowatt of power. This power consumption can dramatically reduce vehicles’ driving range and affect emissions. To address this problem, we propose SAGE: a methodology for selectively offloading the key energy-consuming modules of DL architectures to the cloud to optimize edge, energy usage while meeting real-time latency constraints. Furthermore, we leverage Head Network Distillation (HND) to introduce efficient bottlenecks within the DL architecture in order to minimize the network overhead costs of offloading with almost no degradation in the model’s performance. We evaluate SAGE using an Nvidia Jetson TX2 and an industry-standard Nvidia Drive PX2 as the AV edge, devices and demonstrate that our offloading strategy is practical for a wide range of DL models and internet connection bandwidths on 3G, 4G LTE, and WiFi technologies. Compared to edge-only computation, SAGE reduces energy consumption by an average of 36.13% , 47.07% , and 55.66% for an AV with one low-resolution camera, one high-resolution camera, and three high-resolution cameras, respectively. SAGE also reduces upload data size by up to 98.40% compared to direct camera offloading.
more » « less
Full Text Available

« Prev Next »

Search for: All records